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Abstract 

Functional programming comes in two flavours: one where "functions are first-class citizens" (we 
call this applicative) and one which is based on equations (we call this declarative). In relational 
programming clauses play the role of equations. Hence Prolog is declarative. The purpose of this paper 
is to provide in relational programming a mathematical basis for the relational analog of applicative 
functional programming. We use the cylindric semantics of flrst-order logic due to Tarski and provide a 
new notation for the required cylinders that we call tables. We deflne the Table/Relation Algebra with 
operators suSicient to translate Horn clauses into algebraic form. We establish basic mathematical 
properties of these operators. We show how relations can be first-class citizens, and devise mechanisms 
for modularity, for local scoping of predicates, and for exporting/importing relations between programs. 

1 Applicative versus declarative definitions 

Some functional programming systems are applicative; others declarative. Relational programming, on the 
other hand, only exists in declarative form. In this section we explain how we use this terminology, and 
argue that relational programming should also have both declarative and applicative forms. 

1.1 In functional programming 

There are two ways for defining functions in functional programming, one using A-expressions and the 
other using equations. Consider for example the higher-order function "fzwice". It can be denoted by the 
A-expression "A/.Ax./(/a;)" ; we call this an applicative definition. This function can also be defined in a 
declarative style, that is, by asserting as true certain equations, as follows: 

twice F = g(F) 
g(F) X = F (F X) 

The characteristics of each style can be summarized in the first two columns of tabled In the applicative 
style, functions can be results of functions, can be bound to variables, and so on. Hence they are much 
like other types of values. This is not the case in the declarative style. As a result, it is sometimes said 
that in the applicative style, functions are "first-class citizens." Furthermore, it can be shown that the two 
styles have complementary strengths, namely local scoping and modularity in the applicative one, and a 
natural expression of recursion and selection in the declarative one. Thus it is valuable for a programmer 
to have both available and to be able to switch effortlessly between the two. In functional programming, 
the theory of such a combination has been developed and practical applications have been reported ^ [7] 



1.2 In relational programming 

"Relational programming" is a natural counterpart for functional programming, with relations instead 
of function as basic entities. Logic programming, the best developed form of relational programming, is 
exclusively based on Horn clauses, which constitute a declarative paradigm. As the third column of table 
n suggests, a major problem in relational programming is the absence of an applicative language in which 
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functional 


relational 


implicit 
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declarative 






explicit 






whole function/relation 


A — calculus 
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applicative 







Table 1: Classifying functional and relational definitions. 



relations can be first-class citizens. This paper develops the basis for such a language, keeping in mind the 
importance of it being first-order, and easily intertranslatable. 

2 The meaning of Horn clauses 

As basis for applicative relational programming, we use an algebraic view of the meaning of Horn clauses. 
The following example serves as introduction. Consider the clause for relational composition: p{X, Z) <— 
q{X, Y),r{Y, Z). Relation p is defined by means of operations on q, r, and the tuples of variables. We make 
these operations explicit by appealing to cylindric set algebra 01 |S] . Accordingly, a Horn clause can be 
interpreted as a relational inclusion where the right-hand side is a projection of the intersection of cylinders 
on the relations in the condition part. Consider, for example, the geometric interpretation of the clause as 
illustrated in figure ^ 

Figure 1: A geometric interpretation of relational composition: p contains the projection of the intersection 
of the cylinders on q and r. 

This can be expressed by a formula using the operators of TRA introduced in this paper. As a preview, 
we list the formula here: p D {X, Z)/{q : {X, Y) Dr : (Y, Z)), where ":" is relational application yielding a 
cylinder, "n" is intersection, and "/" is relational projection. To allows local scoping through A-binding, 
we want to have p, q, and r as relational variables. This also allows minimization operators, such as 
to apply as well. The result will be relation-valued expressions in a first-order setting. 

2.1 Relational semantics for first-order logic. 

Truth-functional semantics for first-order logic is an assignment of a truth value to a closed formula, where 
this assignment is relative to a given interpretation that assigns meanings to constants, function symbols, 
and predicate symbols. Relational semantics assigns a meaning to a formula that may have free variables; 
i.e. an open formula. It gives as meaning the relation consisting of the tuples of individuals that, if 
assigned to the free variables in the formula, would give a true closed formula according to truth-functional 
semantics. 

For example, the meaning with respect to an interpretation / of the formula q{X, Y) under relational 
semantics is the binary relation 

r = {(a, /?) is variable-free and true in/} 

For the benefit of further examples, suppose that the universe of discourse is {a, 6, c}^ and that r = 
{(a, 6), (6, c), (c,a)}. 

Note that the meaning of a relation under relational semantics is invariant under renaming of free 
variables. As a result, relational semantics is not a homeomorphism between the algebra of formulas and 

^The above footnote applies here also. 
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the algebra of relations. For example, if it were a homeomorphism, then the meaning of q{X, Y) A q{Y, Z) 
would be the intersection of the meanings of the conjuncts. Hence it would be r n r = r. However, it 
should be the composition of r with itself according to the algebra of binary relations. 
Therefore we need something else: cylindric semantics, introduced in the following. 



Tarski's cylinders. Tarski introduced a device that leads to a semantics for formulas that is a homeo- 
morphism. Consider relations that are subsets of Di x • ■ • x _D„ (call this the domain product) and consider 
a relation h (call it the base) that is a subset of Dj^ x • • • x Dj^^ where the selector, J = {ji, . . . is a 
subset of {1, ... , n}. 

Then 7ry^(fe) = {(a;i, . . . , Xn)\xi e Di, i — 1, . . . ,n and {xj-^ , ■ ■ ■ , Xj^) G 5} is the cylinder with base 6, 
domain product Di x • ■ • x D„, and selector J. A cylinder, being a set of tuples of the same length, is 
always a relation. But a relation is not always a cylinder. 

For example, take a domain product with n equal to 3, Di = D2 = D3 ~ {a, b, c}, with selector equal 
to {1,2}, and with base r — {(a, 6), (6, c), (c, a)}. Then the cylinder specified by these properties is 

{ {a,b,a), {a,b,b), {a,b,c), 

(6,c,a), {b,c,b), {b,c,c), (1) 
(c,a,a), (c,a,5), (c,a,c) } 

Rather than to assign to a formula as meaning a relation according to relational semantics, Tarski 
assigns a cylinder on this relation. He assumes an enumeration of all variables in the language. This gives 
a set J = {ji , . . . , jfe} of integers for a formula with k free variables. The semantics according to Tarski then 
assigns to the formula as meaning the cylinder determined by J as selector on the base that is the relation 
obtained by relational semantics. Such a semantics is a homeomorphism where conjunction corresponds 
to intersection. Other syntactic constructs correspond to other operations in what Tarski calls "cylindric 
set algebra." 

To return to our example, suppose X , Y, and Z are the only variables in the language (hence n — 3), 
and are enumerated in this order. Then the free variables of q{X, Y) correspond to the selector J — {1, 2}. 
According to Tarski, the meaning of q{X, Y) is then cylinder ^ The free variables of q{Y, Z) correspond 
to the selector J ~ {2, 3}. According to cylindric semantics, the meaning of q{Y, Z) is then 

{ {a,a,b), {b,a,b), {c,a,b), 

(a,6,c), {b,b,c), (c,6,c), (2) 
(a, c, a), {b,c,a), (c, c, a) } 

The meaning of q{X, Y)Aq{Y, Z) is the intersection of these two cylinders, that is, {{a, b, a), {b, a, a), (c, a, b)}. 
According to Tarski, the meaning of 3Y.q{X, Y) A q{Y, Z) is obtained by projecting this with selector 
J = {1, 3}. The result is {(a, a), (6, a), (c, b)}, which is the relational composition of r with itself. 

In this paper we define an algebra operating on cylinders and relations that allows us to translate to 
algebraic form the definite clauses of Prolog. We call it TRA, for Table/ Relation Algebra. The current 
example serves to give a preview of this translation. Let a relational composition be defined by the clause 

p{X, Z)^q{X,Y),q{Y,Z). 

In classical syntax this is 

{3Y.q{X,Y)Aq{Y,Z))Dp{X, Z). 

As we just observed, the condition has as meaning the projection on (X,Z) of the intersection of the 
cylinders denoted by q{X, Y) and q{Y, Z). In general, we shall see that a definite clause asserts, interpreted 
in terms of TRA, that the meaning of the conclusion includes the projection (determined by the variables 
in the conclusion) of the intersection of the cylinders that are the meanings of the conditions. 
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2.2 Cylinders defined as tables. 

It is rare to be able to list, as a set of tuples, a cylinder that is a meaning of a formula; the language usually 
has infinitely many variables, so that the tuples are very long. Moreover, there are often infinitely many 
tuples in the cylinder. But there is no problem specifying such a cylinder, as we only need to specify the 
base relation and the selector (the domain product is usually implicitly understood). 
For example, cylinder ^ can be specified by the set of substitutions 

{{X = a, y = 6}, {X = &, r = c}, {X = c,Y = a}}, 

as this specifies that the base relation of the cylinder is {(a, 6), (6, c), (c, a)} and that the selector is {1,2}, 
which corresponds to the variables X and Y in the enumeration. 

Sets of substitutions of this type, where the substituted variables in each are the same, play a central role 
in this paper. Observe, in the first place, that a table is a natural notation for such a set of substitutions. 
The right-hand sides make up the lines of the table, while the left-hand sides need not be repeated, hence 
can be the headings of the table's columns; see table El Because of this, we call such sets of substitutions 
"tables." 



X 


Y 


a 


b 


b 


c 


c 


a 



Table 2: A table for cylinder [J 

In the second place, observe that the set of answers to a Prolog query is a table. For example, suppose 
that the logic program P is the set of facts {q{a, b),q{b, c),q{c, a)}. The query ?- q{X, Y) produces tableEl 
as set of answer substitutions. 

3 The Table/Relation Algebra (TRA) 

TRA is an algebra of operations on relations, tables, queries, and logic programs. It facilitates translation 
to and from definite clause form, and also facilitates scoping and modularity. 

We define an n-ary relation over a Herbrand universe _ff to be a set of n-tuples of elements of H. 
Prolog's answer substitutions for the set of variables in a query can be regarded as a set of equations in 
solved form; i.e. all left-hand sides are variables that occur only there [H]. Restricting the solved forms to 
have a common set of variables as left-hand sides makes it natural to present a set of answer substitutions 
as a table. Hence we define an n-ary table to be a set of sets of equations in solved form, where each set 
of equations in the table has the same set of n variables as left-hand sides. 

The reason for calling the concept just defined "table" is that these sets of equations can be most 
economically represented in print as a table where the left-hand sides are the headings of the columns 
(analogous to the attributes of relational data models) and the right-hand sides are the entries of the table. 
A difference with the relational data model is that in our concept of table the entries can be terms of any 
complexity containing variables without any restriction. 

3.1 Tables as queries on logic programs. 

Tables, as defined above, can be obtained, in logic programming, as follows: 

Definition 1 Let P be a logic program, let Q be a query, and let T be an SLD-tree for P and Q. Then the 
expression {Q where P) has as value the table of the answer substitutions associated with all the success 
leaves of T . 
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When in Prolog a query Q fails for a logic program P, there arc no answer substitutions and the table 
{Q where P) is empty. Wc use the symbol _L (bottom) for all of the empty tables {}. We use another 
special symbol T (top) for the table consisting only of the empty answer substitution {{}}, which results 
from a successful SLD-derivation starting in a query with no variables. 

The following lemmas follow from the definition of "where" . 

Lemma 1 G (Q where P) implies that 6 is a correct answer substitution for {Q} U P. 

Lemma 2 For every correct answer substitution 9 for {Q} U P, 3 77 e (Q where P) such that 6 is an 
instance (with respect to the Herbrand universe) of rj. 

Lemma 3 The value of {Q where P) does not depend on the SLD-tree T in definition^ 
3.2 Intersection of tables 

Tables are a convenient notation for cylinders. As the intersection of cylinders is important, we need 
to define intersection between tables. Let 9i and 62 be answer substitutions to the queries Gi and G2 
respectively, and consider the query ?- Gi,G2- When Gi and G2 have a common variable, 9i U 62 is not 
in solved form and may not be solvable. However, as Colmerauer [3j observed, the solved form of 9i U 92, 
if it exists, is an answer substitution for the query ?- Gi, 62. Hence: 

Definition 2 The intersection operation of tables S and T is 

SnT'^^^ {(j){s Ut) \ 3 e S,t eT,and ^{s U t)}. 

Here ipis) means that its argument set of equations is solvable; 0(s) is only defined when s is solvable 
and then it denotes the solved form of s. For example, if relation g, as defined in a program P, is 
{(a, b), {b, c), (c, d), (d, e)}, then the result of the query ((^ q{X, Y)) where P) fl ((^ q{Y, Z)) where P) is 
table El 



X 


Y 


Z 


a 


b 


c 


b 


c 


d 


c 


d 


e 



Table 3: A table from intersection of two tables. 

The reason for choosing the name "intersection" and the symbol "fl" is given by the following 

Theorem 1 For all tables S and T, S HT is the set intersection of S and T regarded as cylinders (hence 
relations, hence sets). 

It is easy to verify the following properties about the intersection of tables: 

Theorem 2 H is associative and commutative, fl has a unique null element, which is 1- and a unique unit 
element, which is T. 

Theorem 3 For any program P and goal statements Gi, G2 and G which consists of all goals in Gi and 
G2, we have 

(Gi where P) n (G2 where P) = (G where P). 
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Modularity The previous theorem suggests using the combination of where and fl as a modular com- 
positional tool, as in the following example: 

Fi,F2) where P) n ((^ G) where Q). 

Thus we see that within the same table it can be specified of each goal with respect to which program it 
is defined. 

3.3 From relations to tables 

We often need to get a table out of a relation, rather than from a program by posing a query. For this we 
define the relational application operator We introduce its definition through a heuristic development. 
Let r be the n-ary relation {(ai, . . . , a„), (61, ... , 5„), (ci, . . . , c„)}. A table "most like" r can be easily 
constructed by adding as a heading a set of n distinct variables, say Xi, . . . , X„, as in table 0] This table 



Xi 




Xn 


ai 






h 




b„ 


Cl 







Table 4: A table "most like" r. 

can then be identified using an arbitrary predicate symbol, say p, as follows: 

p(Ai, . . . , Xn)) where {p{ei, . . . , e„) | (ei, . . . , e„) £ r}. 

The obvious generalization of allowing any terms ^i, . . . , i„ instead of the distinct variables suggests 

r : {ti, . . . ,<„) =^ p(ti, . . .,tn)) where {p{ei, . . . ,e„) | (ei, . . . ,e„) £ r}. 

Though correct, its arbitrary and auxiliary predicate p is undesirable. It is easily verifiable that this 
definition is equivalent, operationally, to: 

Definition 3 The application operation of an n-ary relation r to a tuple (ti, . . . ,tn) of terms is 
r : {ti,.. .,tn) =^ 

{0({ii = ei, . . . , t„ = e„}) I (ei, . . . , e„) G r and ipUti = ei, . . . , i„ = e„})}. 
3.4 From tables to relations 

Just as we defined the application operator to get tables from relations, we define a projection operation 
to get relations back from tables. This operation should not merely discard the table's "headings"^, as 
suggested in Instead, we present another heuristic development to define it properly. 

Let T be the previous table 2| Now, {{Xi9, . . . ,X„6') \ 9 € T} is the relation resulting from discard- 
ing the heading; i.e. r above. To generahze, we first let {ji, ■ ■ ■ ,jk} be a subset of {!,..., n}. Then, 
{{Xj-^0, . . . ,Xji^0) I e T} is a projection of r over the (ji, . . . ,jk) columns. Hence, projection is a way 
of getting a relation from a table. Secondly, allow any tuple {ti,. . . , tk) of terms instead of the variables 
{Xj^ , • ■ • , Xj^) and consider {{ti9, . . . , tk9) \ 9 G T}. Since for an arbitrary table T any G T may contain 
variables, the result must further be grounded in order to obtain a relation. Hence the final definition: 

Definition 4 The projection operation (denoted / ) of a table T over a tuple of terms {ti, . . . ,t„) is 

{ti,...,tr,)/T = ^{{{ti9,...,t,,9) \9eT}), 

where ^(a;) is the set of variable-free instances of the expression x. 



^One reason is, as mentioned before, that our table's contents include variables, whereas the elements of a relation's tuples 
are variable- free. 
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3.5 Are project and apply inverses? 

Now that we have operations from tables to relations and vice versa, one may wonder whether these are 
each other's inverses. The short answer is, in general, "no" , because 

{{tl,...,tn)/T) : {tl,...,tn) 

is not always the table T. Take, for example, the case that t\,. . . ,tn have no variables. Then the above 

expression is T whenever T is not _L. But the absence of variables in ti, . . . , t„ is a rather pathological case. 
When we add restrictions, we can say that, in a sense, "/" and ":" are each other's inverses, as shown by 
the following theorems. 

Theorem 4 For all tables T and all terms ti,. . . ,tn in which all the variables, and no other ones, in T 's 
heading occur, we have 

((ti,...,t„)/r) : (ti,...,f„) = e(r). 

For an inverse in the other direction, compare the n-ary relation r with 

(ti,...,t„)/(r : (ti,...,t„)). 

That this expression does not always equal r is shown by (c, d)/({(a, 6)} : (c, d)) = {}, where a, b, c and d 
are constants. This example suggests: 

Theorem 5 For all n-ary relations r and all terms ti, . . . , t„, we have 

(ii,...,tJ/(r:(ti,...,t„))Cr. 
However, by strengthening the restrictions, we can have equality instead of inclusion. 
Theorem 6 For all n-ary relations r and all distinct variables have 

{xi,...,Xn)/{r : {xi,...,Xn)) =r. 

3.6 Translation of definite clauses to TRA 

It should be clear now that the operators of TRA correspond closely to the operations hidden in definite 
Horn clauses, according to cylindric semantics. This simplifies the translation of definite clauses to TRA, 

and we therefore omit an explicit description of it. 

The principle of the translation is that each definite clause states that the relation denoted by the 
predicate symbol in the conclusion includes the projection (on the tuple of the terms in the conclusion) of 
the intersection of the tables denoted by the conditions. The operations of TRA are general enough to 
translate the definite clauses of pure Prolog. 

Example. We illustrate the translation by an example that is a typical Prolog program. It gives a 
quicksort program from an ordinary list to a difference list. 
The Prolog program is: 

qsort([ ] ,U-U) . 
qsort([X|Xs] ,U-W) :- 

partition (X,Xs,Yl,Y2) ,qsort(Yl,U- [X| V] ) ,qsort(Y2,V-W) . 

The TRA version is: 

qsortD{{[],U-U)/T) 

U {{[X\Xs],U -W) I (((^parf?:tton(X,Xs,ri,r2)) where P) 

n(Qsort : (ri,;/- 
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qsort is a relational variable and is not part of the language of clausal logic. But all terms, and the 
atomic formula that is the first argument of where, are in clausal logic. 

The entire expression states an inclusion between relations and may or may not be satisfied, depending 
on the value of qsort. It may be shown that there is a least relation as value for qsort that satisfies the 
inclusion. The inclusion serves as definition of this least relation. 

The applicative relational program has the following advantages: 

1. The where expression has a table as value and can be replaced by any other expression with the 
same value; see item 3. In particular, the identifier partition is local to the where expression. The 
declarative Horn clause formalism does not provide locality for predicate names. 

2. The identifier qsort is a variable and is not a predicate symbol. Hence a minimization operator, like 
/i of 0, can be applied to the entire inclusion with respect to qsort. The result is a relation- valued 
expression that can, for example, be the operator argument of the relational application The 
identifier qsort is then local to this operator. In this way any number of levels of locality can be built, 
just as in applicative functional programming. 

3. Combining a selection/minimization operator with the functional A-abstraction and application oper- 
ators yields an import/export facility for relations between program modules. As an example, replace 
P above with 

(Xorder.P) {vleq.Orderings) , 

where order is a relational variable in P used to define the order of partitioning, Orderings is a 
program that defines different orders, and 'V' is the minimization operator that selects one of the 
relations (/eg) defined in a program module {Orderings). The effect of all this is exporting the leq 
relation from program Orderings, and importing it into program P as the order of partitioning, 
yielding an ascending, or descending, qsort relation. 

4 Related work 

This paper is a continuation of a line of research going back as far as the work of Peirce and of Schroeder in 
the 19th century on algebra of relations. These algebras were not adequate to serve as basis for a semantics 
for full first-order predicate logic. In the 1930's Tarski provided cylindric set algebra, which remedied this 
shortcoming. The simplicity of definite clauses, although covered by Tarski's work, suggest the independent 
treatment given in this paper. 

After Tarski's work, the next most important step was the paper by de Bakker and de Roever By 
restricting themselves to binary relations, they took their starting point before Tarski. They elucidated the 
mechanisms of defining binary relations, especially the use of the minimization operator. 77?^ facilitates 
extending of the definition techniques of de Bakker and de Roever to include the type of algebra introduced 
by Tarski. 

There are some relations to recent work that do not seem to be part of a grand design. Codd's relational 
calculus is declarative in spirit, whereas his relational algebra is applicative in spirit. Both seem to us to 
be improved upon by the earlier work by Tarski. However, TRA can be viewed as a relational algebra 
counterpart for the Datalog query language, when substituted for Codd's relational calculus. Our where 
expressions were inspired by Nait-Abdallah's ions. The main difference is that Nait-Abdallah's work is 
syntactic, concentrating on rewriting rules. 

Our work is also related to higher-order logic programming by having the same objectives, but a 
different approach; see the concluding remarks. Finally, applicative programming constitutes important 
related work [H], as mentioned in the introduction. 

5 Conclusions 

We have characterized distinctions between the applicative and declarative paradigms in functional pro- 
gramming. We have noted that relational programming exists only in declarative form. We have developed 
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a relational algebra (TRA) useful as a mathematical basis for applicative relational programming. The 
operations of TRA are chosen in such a way that the definite clauses of Prolog have a simple translation. 

Relations can be first-class citizens. One of the most powerful paradigms in computing is "functions 
as first-class citizens" This in reaction to languages where functions are more restricted in their use 
than, say, numbers. Certain functional programming languages have demonstrated the advantages of 
having functions as first-class citizens. Prolog, the only relational programming language, does not have 
relation-valued expressions of any kind. We have shown that relations can be first-class citizens. 

Modular logic programming. We have shown how to support modularity and local scoping for rela- 
tions, which is a deficiency in declarative logic programming. We have devised a mechanism that facilitates 
the exportation and importation of relations between programs. 

Higher-order logic is not needed, at least not to provide the desirable programming features implied 
by having functions and relations as first-class citizens. In functional programming, a function is said to be 
higher-order when it takes a function as argument or produces one as result. A logic is said to be higher- 
order when one can quantify over function or predicate symbols. These two senses of "higher-order" do 
not coincide. This is proved by the existence of formalizations of A-calculus in first-order logic. A constant 
of logic can denote any individual, including a function. Therefore, a first-order variable, one ranging 
over individuals, can range over functions. In that sense, our expressions contain first-order variables that 
range over relations. This is an important point, as first-order logic is more tractable, theoretically and 
practically, than higher-order logic. 
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